AMENDMENTS TO THE CLAIMS 
Claim 1 (Currently Amended) A video processing apparatus for specifying frames of 
content to be start frames of a plurality of viewing segments of the content, when segmenting 
[[a]] the content into the plurality of viewing segments , the video processing apparatus 
comprising: 

a specifying information memory storin g a plurality of pieces of specifying information 
each piece of specifying information of the plurality of pieces of specifying information (i)- eaeh 
showing a feature of frames to be specified as start frames and each corresponding to a different 
type of content , and (if) including: 

a first condition identifying a feature of frames of the content to be detected as 
candidates for presentation frames, each of the presentation frames for being displayed as a 
representative still image of a respective viewing segment of the plurality of viewing segments; 

an exclusion condition identifying a feature of frames of the content to be 
excluded from bcinu the candidates for the presentation frames; 

a second condition identifying a feature of frames of the content to be detected as 
candidates for start frames; and 

a selection condition identifying a relationship between a presentation frame of 
the content and a frame of the content that is to be selected as a start frame ; 
a content obtaining unit operable to obtain a content; 

an information obtaining unit operable to obtain type information identifying showing the 
type of the obtained content; 
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an extracting unit operable to extract, from the specifying information memory, a piece of 
specifying information , of the plurality of pieces of specifying information, that corresponds 
corresponding to the type of the content identifiedn shewn by the obtained type information; and 

a specifying unit operable , in accordance with the extracted piece of specifying 
information, to (i) specif y the presentation frames of the content by detecting, from all frames of 
the content, frames of the content satisfying the first condition and by subsequently excluding, 
from the detected frames satisfying the first condition, frames satisfying the exclusion condition, 
and (ii) specify start frames of the content by detecting, from all frames of the content, frames of 
the content satisfying the second condition and by subsequently selecting, from the detected 
frames satisfying the second condition, frames satisfying the relationship identified by the 
selection condition with respect to the specified presentation frames start frames pres e nt in th e 
cont e nt, in accordanc e with th e e xtracted piece of specifying information . 

Claim 2 (Cancelled) 

Claim 3 (Currently Amended) The video processing apparatus o f Claim 1 Claim 2 , further 
comprising[[:]] an index storage unit operable to store, in correspondence with the content, a 
respective displa y time- times of each of the specified start frames of the content^ ame and each 
of the specified presentation frames of the content framo specified by the specifying unit . 

Claim 4 (Cancelled) 

Claim 5 (Cancelled) 
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Claim 6 (Currently Amended) The video processing apparatus o f Claim 1 Claim 5 , 
wherein the specifying unit includes: 

a plurality of detecting subunits., eac h detecting subunit of the plurality of 
detecting subunits being operable to detect frames of the content having a different feature; 

an excluding subunit operable to exclude , from the detected frames satisfying the 
first condition, frames satisfying the exclusion condition from frames satisfying the first 

a selecting subunit operable to select , from the detected frames satisfying the 
second condition, frames satisfying the relationship identified relation shown by the selection 
condition from frames satisfying the second condition , and 

wherein each of the first condition, the exclusion condition, and the second condition 
e ach ar e j s an identifie r to be used by one detecting subunit- ef-eae of the plurality of detecting 
subunits to bo used . 

Claim 7 (Currently Amended) The video processing apparatus o f Claim 6 Claim A , 
wherein when operating in accordance with |"[a"H me extracted piece of specifying information 
corresponding to the a predetermined type o f the content identified by the obtained type 
information , the specifying unit (i) detects,, from all-the frames present in of the content,_a 
plurality of large-caption start frames., each large-caption start frame, of the plurality of large- 
caption start frames, being of which is a first frame of a series of frames of the content during 
which a caption of a size larger than a threshold continuously appears in a predetermined region, 
small caption frames in each of which a caption of a size smaller than a threshold appears in a 
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region other than the predetermined region, CM frames which constitute a commercial message, 
and transition frames each of which is a first frame of a series of frames of similar images, and 
(ii) specifies^ as a presentation frame of the content, each large-caption start frame, of the 
plurality of large-caption start frames^ rame remaining after removin g, from the plurality of 
large-caption start frames,4 he small-caption frames having a caption of a size smaller than a 
threshold appearing in a region other than the predetermined region and the CM frames from the 
large caption start frames, and (iii) specifies as a start frame, for each presentation frame, a 
closest preceding transition frame to the presentation frame . 

Claim 8 (Currently Amended) The video processing apparatus o f Claim 7 Claim <] , 
wherein when operating in accordance with [[all the extracted piece of specifying information 
corresponding to a pr e d e t e rmin e d the type of the content identified by the obtained type 
information , the specifying unit (i) detects, from all frames present in the content, e xclud e s 
frames which constitute a commercial message from all tho frames present in the content, (ii) 
detects from the remaining frames, largo caption start frames each of which is a first frame of a 
series of frames during which a caption of a size larger than a threshold continuously appears in 
a prodotorminod region, small caption frames in each of which a caption of a size smaller than a 
threshold appears in a region other than tho prodotorminod region, and a plurality of transition 
frameSi eac h transition frame, of the plurality of transition frames, being (a) of which is a first 
frame of a series of frames of the content of similar images, (b) silent frames of the content 
having audio data below a predetermined volume level, (c ) music-start frames of the content 
having a first frame of a series of frames of the content having audio data representing a piece of 
music data, or (d) speech-start frames having a first frame of a series of frames of the content 
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having audio data representing a speech of a specific speake r (iii) specifies as a presentation 
frame each frame remaining after removing the small caption frames from the large caption start 
frames, and (iv) and (ii) specifies as a start frame of the content , for each presentation frame of 
the content , a frame of the content closest preceding transition frame to the presentation frame 
among all of the detected frames preceding the presentation frame . 

Claims 9-15 (Cancelled) 

Claim 16 (Currently Amended) The video processing apparatus o f Claim 1 Claim 2 , further 
comprising[[:]] a playback unit operable to play back the content starting from a start frame of 
the content specified by the specifying unit. 

Claim 17 (Currently Amended) The video processing apparatus of Claim 16, furth e r 

wherein the video processing apparatus further comprises: 

an index storing unit operable to store pairs of display times of each start frame 

and presentation frame specified for a respective viewing segment , of the plurality of viewing 
segments, by the specifying unit; 

a display unit operable to display a presentation frame specified for each viewing 

segment , of the plurality of viewing segments, by the specifying unit; and 

a user-selection unit operable to select , in accordance with a user selection, at 

least one of the displayed presentation frames displayod, in accordance with a user selection , and 



7 



wherein the playback unit plays back the content starting from a start frame of a viewing 
segment , of the plurality of viewing segments, to which the user-selected presentation frame 
belongs. 

Claim 18 (Currently Amended) The video processing apparatus of Claim 17, wherein the 
display unit displays the presentation frames specified for the plurality of viewing segments by 
generating a thumbnail image of each presentation frame of the presentation frames of the 
content and displaying the generated thumbnail images in list form. 

Claim 19 (Currently Amended) The video processing apparatus of Claim 17, wherein 

wherein the user-selection unit stores the at least one of the selected presentation frames 

fram e as a reference image into the specifying information memory, and 

wherein the specifying unit specifies the presentation frames by detecting frames of the 

conten t which are similar to the reference image with respect to a location of a region in which a 

caption appears. 

Claim 20 (Currently Amended) The video processing apparatus of Claim 1 , further 
comprising: 

wherein the video processing apparatus includes a recording unit operable to obtain a 
content and type information of the content, and to record the content to a recording medium in 
correspondence with the type information, whoroin 
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wherein after the recording unit records the type information and at least_a part of the 
content, the content obtaining unit sequentially obtains the part of the content from the recording 
medium, and 

wherein the specifying unit sequentially specifies_a start frame present in the part of the 
content obtained by the content obtaining unit. 

Claim 21 (Currently Amended) The video processing apparatus of Claim 1, further 
comprising: 

wherein the video processing apparatus comprises a recording unit operable to obtain a 
content and type information of the content, encode the content, and record the encoded content 
in correspondence with the type information, wherein 

wherein, after the recording unit records the type information and encodes at least_a part 
of the content, the content obtaining unit sequentially obtains the encoded part of the content, 
and 

wherein the specifying unit obtains an analysis analyses of the encoded part of the 
conten t conducted by the recording unit for the encoding , and sequentially specifies_a start frame 
present in the encoded part of the content using th e obtained analysis analyses . 

Claim 22 (Currently Amended) The video processing apparatus of Claim 1, further 
comprising[[:]] an updating unit operable to obtain a new version o f a piece of specifying 
information , of the plurality of pieces of specifying information, that corresponds corresponding 
to a specific type of content, and operable to record the new version o f the piece of specifying 
information to the specifying information memory. 
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Claim 23 (Currently Amended) The video processing apparatus of Claim 22, wherein 

wherein the updating unit obtains the new version o f the piece of specifying information 
when the video processing apparatus is connected^ via a communication network^ to a provider 
apparatus for providing specifying information, and judges judging that the new version of the 
piece of specifying information is available, and 

wherein the new version o f the piece of specifying information is recorded to the 
specifying information memory by updating , to the new version of the piece of specifying 
information, a stored piece of specifying information , of the plurality of pieces of specifying 
information, stored therei n that corresponds corresponding to the specific type to the new 

Claim 24 (Currently Amended) The video processing apparatus of Claim 23, wherein_a-the 
judgment as to whether the new version o f the piece of specifying information is available is 
made each time the specifying unit processes the specific type of content. 

Claim 25 (Currently Amended) An integrated circuit for use in a video processing 
apparatus that specifies frames of content to be start frames of a plurality of viewing segments of 
the content, when segmenting [[a]] the conten t into the plurality of viewing segments , the video 
processing apparatus having a specifying information memory storing a plurality of pieces of 
specifying information each showing a feature of frames to bo specified as start frames and^ each 
piece of specifying information of the plurality of pieces of specifying information 
corresponding to a different type of content[[,H and including (i) a first condition identifying a 
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feature of frames of the content to be detected as candidates for presentation frames, each of the 
presentation frames for being displayed as a representative still image of a respective viewing 
segment of the plurality of viewing segments, (ii) an exclusion condition identifying a feature of 
frames of the content to be excluded from being the candidates for the presentation frames, (iii) a 
second condition identifying a feature of frames of the content to be detected as candidates for 
start frames, and (iv) a selection condition identifying a relationship between a presentation 
frame of the content and a frame of the content that is to be selected as a start frame, the 
integrated circuit comprising: 

a content obtaining module operable to obtain a content; 

an information obtaining module operable to obtain type information identifying showing 
the type of the obtained content; 

an extracting module operable to extract from the specifying information memory a 
piece of specifying information , of the plurality of pieces of specifying information, that 
corresponds corresponding to the type of the content identified shown by the obtained type 
information; and 

a specifying module operable to specify start frames present in the content , in accordance 
with the extracted piece of specifying informatio n, to (i) specify the presentation frames of the 
content by detecting, from all frames of the content, frames of the content satisfying the first 
condition and by subsequently excluding, from the detected frames satisfying the first condition, 
frames satisfying the exclusion condition, and (ii) specify start frames of the content by 
detecting, from all frames of the content, frames of the content satisfying the second condition 
and by subsequently selecting, from the detected frames satisfying the second condition, frames 
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satisfying the relationship identified by the selection condition with respect to the specified 



presentation frames . 



Claim 26 (Currently Amended) A video processing method for use by a video processing 
apparatus that specifies frames of content to be start frames of a plurality of viewing segments of 
the content, when segmenting [[a]] the content into the plurality of viewing segments , the video 
processing apparatus having a specifying information memory storing a plurality of pieces of 
specifying information each showing a feature of frames to be specified as start frames and^ each 
piece of specifying information of the plurality of pieces of specifying information 
corresponding to a different type of content and including (i) a first condition identifying a 
feature of frames of the content to be detected as candidates for presentation frames, each of the 
presentation frames for being displayed as a representative still image of a respective viewing 
segment of the plurality of viewing segments, (ii) an exclusion condition identifying a feature of 
frames of the content to be excluded from being the candidates for the presentation frames, (iii ) a 
second condition identifying a feature of frames of the content to be detected as candidates for 
start frames, and (iv) a selection condition identifying a relationship between a presentation 
frame of the content and a frame of the content that is to be selected as a start frame , the video 
processing method comprising the stops of : 
obtaining a content; 

obtaining [[a]] type information identifying showing a the type of the obtained content; 

extracting., from the specifying information memory, a piece of specifying information, of 
the plurality of pieces of specifying information, that corresponds corresponding to the type of 
the content identified shown by the obtained type information;-and 
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specifying start frames present in the content , in accordance with the extracted piece of 
specifying information , presentation frames of the content by detecting, from all frames of the 
content, frames of the content satisfying the first condition and by subsequently excluding, from 
the detected frames satisfying the first condition, frames satisfying the exclusion condition; and 

specifying, in accordance with the extracted piece of specifying information, start frames 
of the content by detecting, from all frames of the content, frames of the content satisfying the 
second condition and by subsequently selecting, from the detected frames satisfying the second 
condition, frames satisfying the relationship identified by the selection condition with respect to 
the specified presentation frames . 

Claim 27 (Currently Amended) A non-transitory computer-readable recording medium 
have a video processing program recorded thereon, the video processing program for causing a 
device to specify frames of content to be start frames of a plurality of viewing segments of the 
content, when segmenting [[a]] the content into the plurality of viewing segments , the device 
having a specifying information memory storin g a plurality of pieces of specifying information 
each showing a feature of frames to bo specified as start frames and ^ each piece of specifying 
information of the plurality of pieces of specifying information corresponding to a different type 
of content[[,H and including (i) a first condition identifying a feature of frames of the content to 
be detected as candidates for presentation frames, each of the presentation frames for being 
displayed as a representative still image of a respective viewing segment of the plurality of 
viewing segments, (ii) an exclusion condition identifying a feature of frames of the content to be 
excluded from being the candidates for the presentation frames, (iii) a second condition 
identifying a feature of frames of the content to be detected as candidates for start frames, and 

13 



(iv) a selection condition identifying a relationship between a presentation frame of the content 
and a frame of the content that is to be selected as a start frame, the video processing program 
causing a computer to execute a method comprising the steps of : 
obtaining a content; 

obtaining [[a]] type information identifying the showing a type of the obtained content; 

extracting, from the specifying information memory, a piece of specifying information, of 
the plurality of pieces of specifying information, that corresponds corresponding to the type of 
the content identified shown by the obtained type information;-and 

specifying start frames present in the content , in accordance with the extracted piece of 
specifying information , presentation frames of the content by detecting, from all frames of the 
content, frames of the content satisfying the first condition and by subsequently excluding, from 
the detected frames satisfying the first condition, frames satisfying the exclusion condition; and 

specifying, in accordance with the extracted piece of specifying information, start frames 
of the content by detecting, from all frames of the content, frames of the content satisfying the 
second condition and by subsequently selecting, from the detected frames satisfying the second 
condition, frames satisfying the relationship identified by the selection condition with respect to 
the specified presentation frames . 
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